^^gna^es^rr^^OLMR^IuW-Ang^r'Data' Owr^^real^Regions 

J, pierce 1 , M. D.az-Barrios«, J. Pinzotf, S.L. Us.in‘ , f » S. Toumcs', 

3 National Chiao Tung University, Taiwan 
4 NASA Ames Research Center, Moffett Field, CA 93401, USA 


Abstract - This study used Support Vector Machines to classify 
multiangle POLDER data. 

I, Introduction 

Boreal wetland ecosystems cover an estimated 90 x 10 
ha, about 36% of global wetlands, and ^amajo^source rf 
trace eases emissions to the atmosphere [1]« 

Dercent of the global emission of methane to the atmospher 
comes' fiom wetlands north of 40"N latitude [2], L-JJ 
uncertainties in emissions exist because of arge spai 
temnoral variation in the production and consumption rf 
Sane. Accuntte knowledge of the ar^l “ten. rf open 
water and inundated vegetation is cntical to estimat g 
magnitudes of trace gas emissions. Improvements in 
cove, mapping have been sough, using 
approaches [3],[4], neural networks [ 51 , M, “ 

microwave [7], examples that demonstrate the difficulties at 
separating open water, inundated vegetation and dry upland 
vSS Here we examine the feasib.l.ty of using a 
Sort vector machtne to classify POLDER data Resenting 
open water, inundated vegetation and dry upland vegetation. 

II. METHODS 

Airborne POLDER sensor [8] data were collected in the 
red (665 nm) and infrared (865 nm) 

July 1994 over the Southern Study Area (SS. A) c of the OKeal 
Ecosystem- Atmosphere Study (BOREAS) J* * 

105°W in central Saskatchewan, Canada. After resamp g 
and registration, each POLDER pixel represented l. ground 
area of 150m x 150m that was observed m each of 16 different 
view directions spaced at 5° intervals in the principle plane 
±50° about nadir. 

The forest cover map used as ground truth, derived from 
forest cover data provided by the Saskatchewan E ™ r ° n ™”} 
and Resource Management, Forestry Branch-Inv > fj k ' ’ 
indicated the land cover consisted of fens (6.9 /o) black .sp™ 
(27.4%), jack pine (13.6%), and aspen (23.5/6), grasses, 

mixed land use, and open water. 

The Support Vector Machine (SVM) classifier [9] was 
designed to meet two objectives, i) that it is accurate when 
asSd against a test set and ii) that we can deduce from its 


design a level of confidence that the trained machine will 
accurately classify comparable data outside of the training and 
testing sets 1 Our analysis was restricted to four candidate 
polynomial support vector machines: linear, quadratic, qu 
& and septic (7 th degree). Machines were trained o 
discriminate open water from all other class^ w tho 
„ 0 , hrt . t u n t is to seek to minimize the empirical nsx 
function allowing" no error in classification. AU machmes were 
trained on 25% of the classified data from eac £ * 
wavelength bands, red and near infrared, of the POLDER dat . 

III. Results and Discussion 

Table I provides estimates of the Vapnik-Chervonenkis 
(VC) dimension and the mean probability of misclassification 
when aDDlied to subsequent data as well as a conservative 
^aSr ffie upper bound on the 95% confidence interval. 
While higher order machines provide better training results 
they generally prove less robust when applied t0 ’ 
data The loss of robustness is seen by the growth of the VC 

dimension; the loss of confidence ,s m I £££$ 

of the upper bound of the confidence interval. The balance at 
emnfrical risk versus generality favors the linear machine, 
with an estimated mean probability of misclassification 
Sequent data of a remarkable 2%, but with a 95 A 
coSce interval for the conservative upper bound <f 
68.23%. Both red and near infrared spectral bands are nearly 

equal in predictive ability. 

By establishing a classifier solely based on the presence or 
absence of open water, the machines were less able to 
discriminate inundated vegetation from dry vegemion for 
either red or infrared bands. Accurat* ; discnmma ion d 

inundated vegetation was achieved only by overfitting 
training data, thereby sacrificing robustness of the suppo 
vector machine. After open water has been 
image a support vector regression machine, rather than a 
classifier might provide a basis for discriminating the 
remaining community types. Alternatively, machines that use 
multispectral multiple view direction bends may ^ provtde 
sufficiently more information about ttie sunace c 
accuracy is improved. The results also demonstate that toe 
septic 7 th degree machine was the most robust of the three 
Xomial machines for each of the three binary classifiers of 

interest. 


iv. Conclusions 

This study reports results .of applying Support Vector 

TOLDER ”a,a “ T" W^ngt 

™ data - These results suggest that SVM can be 

applied to extend our analysis to classify additional types of 

nundated vegetation and non-inundated vegetation bv 

m “ .' n VS™ waMr “ « classified an£ adding 

specrai bMds - * 
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support vectors 
Margin 

Bounding Sphere 
VC Dimension 

Mean Probability of Misciassification (%) 
25 /v Confidence Intent! H pper Ronrid f°/ r 



5 3 5 

3-461 112.5 5.490 

3.856e+4 4,316,800 2.343e+8 
3877 1365 170,545 

3-33 2.00 3.33 
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